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Figure 1. Schematic representation of the preferential attachment process: The probability that a tie appears between node A and another node is 
a function of the number of ties of the other node. In this example, probabilities of new ties (in green) are indicated by line width. In this paper, we 
measure this relationship between the degree and the tie creation probability, modeling it as a power with an exponent whose values we explain by the 
processes underlying the network. 



ABSTRACT 

We perform an empirical study of the preferential attach- 
ment phenomenon in temporal networks and show that on 
the Web, networks follow a nonlinear preferential attachment 
model in which the exponent depends on the type of net- 
work considered. The classical preferential attachment model 
for networks by Barabasi and Albert (1999) assumes a lin- 
ear relationship between the number of neighbors of a node 
in a network and the probability of attachment. Although 
this assumption is widely made in Web Science and related 
fields, the underlying linearity is rarely measured. To fill 
this gap, this paper performs an empirical longitudinal (time- 
based) study on forty-seven diverse Web network datasets 
from seven network categories and including directed, undi- 
rected and bipartite networks. We show that contrary to the 
usual assumption, preferential attachment is nonlinear in the 
networks under consideration. Furthermore, we observe that 
the deviation from linearity is dependent on the type of net- 
work, giving sublinear attachment in certain types of net- 
works, and superlinear attachment in others. Thus, we in- 
troduce the preferential attachment exponent /3 as a novel nu- 
merical network measure that can be used to discriminate dif- 
ferent types of networks. We propose explanations for the 
behavior of that network measure, based on the mechanisms 
that underly the growth of the network in question. 
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INTRODUCTION 

The term preferential attachment refers to the observation 
that in networks that grow over time, the probability that an 
edge is added to a node with d neighbors is proportional to 
d. This linear relationship lies at the heart of Barabasi and 
Albert's scale-free network model [2], and has been used in 
a vast number of subsequent work to model networks, on- 
line and offline. The scale-free network model results in a 
distribution of degrees, i.e., number of neighbors of individ- 
ual nodes, that follow a power law with negative exponent. 
In other words, the number of nodes with d is proportional 
to d in these networks, for a constant 7 > 1. While a 
large amount of work has been done to verify empirically 
the validity of such power laws of the degree distribution, 
relatively little work has investigated whether the initial as- 
sumption of linear preferential attachment is valid. The only 
such study known to the authors is that of Jeong, Neda and 
Barabasi [29], which observes a preferential attachment func- 
tion that is a power of the degree with an exponent in the 
range [0.80, 1.05]. However, that study investigates only four 
network datasets representing only a small subset of network 
types encountered on the Web, and does not explain why the 
networks have a specific value of the exponent, which is cru- 
cial to better understand the dynamics and social processes 
that underlie preferential attachment, and thus the behavior of 
online networks in general. Due to the availability in recent 
years of a large number of independent network datasets cov- 
ering diverse aspects of the World Wide Web, we are able to 
study the behavior of forty-seven network datasets, and can 
interpret the observed values of the preferential attachment 



exponent in terms of the social processes underlying the indi- 
vidual networks. 

The contributions of this paper are: 

• We provide a method for measuring the preferential attach- 
ment exponent /? empirically in networks for which tempo- 
ral information is known. 

• We perform an extensive and systematic study of preferen- 
tial attachment in forty-seven onHne networks from seven 
different network categories. 

• We give interpretations for the observed values, showing 
that six out of the seven network categories display prefer- 
ential attachment behavior with values of the preferential 
attachment exponent consistently above or below the value 
p = l. 

• We interpret these findings in terms of social processes and 
explain why network categories feature such consistent be- 
havior. 

The remainder of the paper is structured as follows. In Sec- 
tion Related Work, we give a detailed review of growth mod- 
els leading to power law-like distributions, both for networks 
and non-network datasets. In Section Method, we introduce 
our method for empirically measuring the preferential attach- 
ment exponent of a given temporal network dataset. In Sec- 
tion Experiments, we apply our method to a collection of 
forty-seven temporal network datasets. Finally, in Section 
Explanations, we give explanations for the observed behav- 
ior. 

RELATED WORK 

The concept of preferential attachment has received ample 
attention in science, in particular network science [60]. Ac- 
companying the rise of the World Wide Web, a new genera- 
tion of studies was published. Aside from renewed interest in 
explanatory models (starting with Barabasi and Albert's sem- 
inal Science paper in 1999 [2], see also next section), social 
scientists started to use the concept in order to explain social 
processes. Preferential attachment is generally understood as 
a mechanism where newly arriving nodes have a tendency to 
cormect with already well-connected nodes [3, 43, 60]. 

Preferential attachment has been used to explain observa- 
tions in a variety of networks. For example, Lemarchand 
[34] and Wagner and Leydesdorff [58] explain evolving co- 
authorship networks based on preferential attachment. Sim- 
ilarly, Barabasi and colleagues [3] investigate collaboration 
networks in science and find that preferential attachment acts 
as a govemance mechanism in the evolution of these net- 
works. Also, Hanaki and colleagues [28] find that collabo- 
ration networks in the IT industry are significantly related to 
preferential attachment. Gay and Dousset [23] investigated 
the biotechnology industry, where new firms attach prefer- 
entially to older and "fitter", i.e., more successful, firms. A 
still emerging field of research is that of onUne networks, 
continuing the research stream started by Barabasi and Al- 
bert [2] more than a decade ago. For example, Tremayne 
and colleagues [56] investigated preferential attachment in 



the war blog network, where links from other blogs and re- 
porting posts were significant predictors. Most recently, Faraj 
and Johnson [21] have investigated open source software net- 
works and found, surprisingly, a tendency away from prefer- 
ential attachment. 

Power Laws and Related Distributions 

Power laws and related distributions such as the lognormal 
and Simon- Yule distributions can be understood as the re- 
sult of generative processes and are observed in many differ- 
ent areas, e.g., physics, biology, astronomy and economics. 
Kapteyn [30] and Gibrat [24] are recognized as two of the 
first scientists connecting generative processes with lognor- 
mal distributions, in 1916 and 1933 respectively. Champer- 
nowne [10] showed that a small change in the lognormal gen- 
erative process results in a generative process with a power 
law distribution. Yule [63] explained the observed power law 
distribution of species among genera of plants. A clean ex- 
planation how preferential attachment leads to a power law 
distribution was given by Simon [54]. Zipf |65| found that 
word frequencies follow a power law distribution and Lotka 
[38] showed the number of written articles by authors follows 
a power law distribution as well. 

Recent work on power law distributions is focused on graph 
and network structure, e.g., the World Wide Web, the Internet, 
collaboration networks and others in connection with pref- 
erential attachment mechanisms [1, 2, 6]. Finally, Newman 
[13, 45] outlined the challenges in measuring power law ex- 
ponents in real data. 

Nonlinear Preferential Attachment 

Traditionally, the preferential attachment function is assumed 
to be Unear, i.e., directly proportional to the node's degree. A 
natural generalization is to an arbitrary function of the node 
degree, in particular to sub- and superlinear functions [15, 22, 
33, 46]. 

Superlinear preferential attachment functions in trees are in- 
vestigated in [52]. The asymptotic distribution of degrees has 
the probability P = 1 for one vertex (the perpetual hub) and 
the probability P = for all other vertices. Physically, this is 
a gelation-like phenomenon. In [32] it is shown that a mod- 
erate superlinear preferential attachment function leads to a 
degenerate degree distribution in the thermodynamic limit, 
where one node receives almost all edges. However, in a 
wide range of the pre-asymptotic regime the degree distribu- 
tion follows a power law distribution. 

In the sublinear case of a preferential attachment function that 
is a power with an exponent between zero and one, a stretched 
exponential degree distribution is the result [15]. 

Measuring Preferential Attachment 

In most network studies, only a static snapshot of a network 
is available, and thus is is not possible to verify empirically 
whether preferential attachment takes place. Instead, most 
papers study the degree distribution of a network, and inter- 
pret its specific forms as evidence for preferential attachment. 
All references cited in the section Power Laws and Related 
Distributions fall into this category. 



The preferential attachment exponent itself is measured for 
several networks in [29], where all exponents were found 
in the range [0.8,1.0]. Newman [44] investigated a scien- 
tific collaboration network, finding a linear preferential at- 
tachment up to a degree « 500, and a sublinear preferential 
attachment beyond that value. These investigations show that 
a linear preferential attachment function is rarely observed in 
real world data, and that sub- and superlinear functions play 
a pivotal role in governing the network growing process at- 
tributed to the preferential attachment mechanism. 
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METHOD 

The concept of preferential attachment has slightly different 
interpretations depending on the type of networks considered. 
In order to distinguish these different cases, we classify the 
networks available to us into seven categories, depending on 
the underlying entities and relationships represented by ver- 
tices and edges. 

Although some networks are genuinely undirected, such as 
a friendship network, many networks represent asymmetric 
relationships that allow us to distinguish active and passive 
nodes, depending on the role they play in the creation of 
edges: 

• Some networks are directed, i.e., each edge possesses an 
intrinsic orientation from one node to another. An example 
is an email network, in which the nodes are persons and di- 
rected edges represent individual email messages. In such 
networks, the pointed-to node is the passive node and the 
pointing node is the active node. Thus, in the context of 
interpreting an edge as an attachment, it is the pointed-to 
node that receives the attachment. Therefore, we will only 
consider the indegree of nodes in these networks, i.e., the 
number of edges pointing to a given node. 

• Some networks are bipartite. A bipartite network contains 
two kinds of vertices, and all edges connect one node type 
with the other. In these networks, we can distinguish be- 
tween active and passive nodes. As an example, the user- 
song network where edges represent the "has listened to" 
relationship contains users (which are active) and songs 
(which are passive). We will thus only consider the set 
of passive nodes and their degrees in this case. 

Note that in these two cases, the resulting degree distribution 
of passive nodes tends to exhibit power-law behavior much 
more often than the active nodes, as shown in Figure 2 with 
two examples. In the following, we describe the seven cate- 
gories of networks we study. 

Social networks 

Social networks consist of persons connected by social ties 
such as friendship. The social networks we consider are based 
on online social networking sites and therefore, the consid- 
ered social ties are online contacts. Social networks allow 
only a single edge between a given node pair, i.e., multiple 
edges are not allowed. Some social networks have positive 
and negative edges, representing positive and negative ties 
such as friendship and enmity, or trust and distrust. In that 
case, we are only interested in the presence of a tie. Some 
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Figure 2. Examples of the degree distributions of active and passive 
nodes in one directed and one bipartite networli. The passive degree 
distributions are much nearer to a power law than the active degree dis- 
tributions, (a-b) The outdegree and indegree distribution of the directed 
social network of Twitter (Wa), (c-d) The left and right degree distribu- 
tion of the bipartite Filmtipset rating network (Fr). (a) and (c) are active 
degree distribution; (b) and (d) are passive degree distributions. 



social networks are undirected, and others are directed. For 
directed social networks, a directed edge from person A to 
person B means that person A is following or otherwise con- 
nected in an unilateral way with another person. These di- 
rected connections can of course be reciprocated, in which 
case two edges connect a given node pair in opposite direc- 
tion. Preferential attachment in a social networks results in 
the rule that people who already have many ties are more 
likely to receive new ties. In directed social networks, pref- 
erential attachment means that people who are followed by 
many people (i.e., are popular) are more likely to receive new 
followers. 

Rating networks 

A rating network is a bipartite network between persons and 
items they have rated. The nodes are persons and items, and 
each edge connects a person with an item, and is annotated 
with a rating. In the datasets we will consider, items can be 
movies, songs, products, jokes and even sexual escorts. Note 
that persons in rating networks are often called users, since 
the datasets are used in online recommender systems. The 
ratings values in rating networks will be ignored in our ex- 
periments. In other words, we consider only the information 
whether a person has rated an item. Rating networks usually 
only allow a single rating of a given item by a given person, 
although we also allow networks with parallel edges in this 
category. We also include in this category unweighted rating- 
like features such as persons liking items or marking them as 
favorites, since we ignore ratings anyway; in both cases an 
edge represents an endorsement (positive or negative) of an 
item by a person. We consider preferential attachment only 



of items. In other words, preferential attachment in rating 
networks refers to the fact that items with many items will 
receive more ratings in the future. Note that this statement is 
independent of the actual ratings given, but only refers to the 
information of whether a rating was or will be given. 

Communication networi^s 

A communication network consists of persons which ex- 
change information in the form of individual messages. Since 
each message is represented by an edge, multiple edges con- 
necting two persons are allowed. Edges in communication 
networks are always directed and can represent emails, other 
types of messages in social media such as "wall posts" in 
Facebook, or replies to another person in online forums. Pref- 
erential attachment in communication networks refers to the 
observation that persons who have already received many 
messages are more likely to receive messages in the future. 

Folksonomies 

Folksonomies consist of a set of tag assignments, which are 
person-tag-item triples that denote that a given person has as- 
signed a given tag to a given item. Items can be as diverse as 
websites, movies or scientific papers. Tags are strings which 
are intended to describe or classify the item. We consider 
two types of preferential attachment types in folksonomies: 
preferential attachment on tags and preferential attachment 
on items. Preferential attachment on tags refers to the obser- 
vation that tags which have been used in many tag assign- 
ments are more likely to be used in new tag assignments. 
On the other hand, preferential attachment on items refers to 
the observation that items which have been tagged often are 
more likely to receive tag assigimients in the future. We will 
distinguish the two cases by simply considering them as two 
different bipartite networks: the person-tag network and the 
person-item network. 

l/l//7(/ edit networl<s 

Wiki edit networks are bipartite networks between users of 
wikis and the pages they edit, where each edge denotes a sin- 
gle edit. Wiki edit networks thus allow multiple edges be- 
tween a user-page combination. All wikis considered are 
Wikimedia sites such as Wikipedia, Wiktionary, etc. Pref- 
erential attachment in edit networks refers to the observation 
that pages which have received many edits are more likely to 
receive many edits in the future. Note that wiki edit networks 
are part of the more general category of authorship networks, 
but that for usual (non-wiki) works, the set of authors is fixed 
and thus preferential attachment is impossible. 

Explicit interaction networl<s 

An explicit interaction network consists of people and inter- 
actions between them. Examples are people that meet each 
other, or scientists that write a paper together. Explicit inter- 
action networks are unipartite and allow multiple edges, i.e., 
there can always be multiple interactions between the same 
two persons. They can also be both directed or undirected. 
Preferential attachment in explicit interaction networks refers 
to the observation that people who have had many interac- 
tions with other people in the past are more likely to have 
interactions in the future. 



Implicit interaction networks 

Implicit interaction networks are networks where the inter- 
action between people is not encoded in direct ties between 
them, but in indirect ties through things with which people in- 
teract. Thus, an implicit interaction network is a bipartite net- 
work consisting of people and things, in which each edge rep- 
resents an interaction. Examples of implicit interaction net- 
works are people writing in forums, commenting on movies 
or listening to a song. In implicit interaction networks, we 
always allow multiple edges between the same person-thing 
pair. Preferential attachment in implicit interaction networks 
refers to the observation that things which have been inter- 
acted with many times in the past are more likely to receive 
interactions in the future. 

Definitions 

Let G = {V, E) be an undirected, unweighted network al- 
lowing multiple edges, in which V is the set of nodes and E 
is the multiset of edges. The number of neighbors of a node 
u&V is called its degree and is defined as 

d{u) = \{{u,v} gE\vG V}\. 

We explicitly allow multiple edges between two nodes, and 
count them separately in this definition of the degree. We 
ignore all edge weights, such as ratings in rating networks or 
the positive and negative signs of signed networks. We also 
allow loops, i.e., edges from a node to itself, as these may 
appear for instance in email networks when people send an 
email to themselves. 

In order for preferential attachment to be observed directly 
(as opposed to observing the resulting degree distribution), 
we need to know the evolution of a given network. Thus, 
we need to know at which time each edge was added to the 
network and can then consider the evolution of the network 
as a function of time. We thus need to observe a temporal 
network at an intermediate time ti and at the latest known 
time t2- In this paper, we will choose ti such that at the time 
ti, 75% of all edges have been added to the network. This 
value is chosen such that is corresponds to the split used in the 
link prediction studies between known and unknown edges 
[36]. Note that preferential attachment has been exploited 
for implementing link prediction, under the same conditions 
we use here. For a given network G = {V,E), let Gi = 
{V,Ei) be the network with the same vertex set as G, and 
containing all edges created before ti. Denoting by t{{u, v}) 
the creation time of an edge {u, f }, we have 

El = {{u,v} e E I ti{u,v}) < h}. 

Let di{u) be the degree of a node in Ei, i.e., 

di{u) = \{{u,v}eEi\veV}\. (1) 
Preferential Attacfiment Functions 

We can now give the definition of a preferential attachment 
function. A preferential attachment function is a function that 

maps the number of edges at time to the number of news 
edges received after ti.ln other words, a function / such that 
/(rfi(u)) approximates d2{u) = d{u) — di{u) for all nodes 



u e V. The values returned by a preferential attachment 
function / will be called attachment values. 

Different network growth models can be expressed in terms 
of the preferential attachment function / they are based on. 
We will consider all functions only up to a constant factor, 
since we are only interested in the relative attachment values 
of different vertices. 

• f{d) = 1. In this model, the attachment is independent 
of the degree. This growth models leads to networks in 
which all edges are equally hkely independently from each 
other, i.e., the Erdos-Renyi model [20]. These networks 
are usually simply called random graphs. 

• f{d) = d. In this model, preferential attachment is linear. 
This corresponds to the Barabasi-Albert model of scale- 
free networks [2]. 

• f{d) = d^ . In this model, the attachment is an arbitrary 
power of the degree [15, 22]. 

• /(d) = (1 + d)^ . This model modifies the previous one in 
that it gives a nonzero attachment value even to nodes of 
zero degree [29]. 

In this paper, we will use the slightly more general form 
/(d) = e"(l + d)^-A 

^ ga+/31n(l+d) _ 

in which A is a regularization parameter, whose purpose will 
become clear in the following. The exponent (5 will be called 
the preferential attachment exponent. The parameter a is a 
multiplicative term that we can ignore since values of / are 
to be interpreted only up to a constant factor. 

Generalization of Previous lUlodels 

Individual graph growth models can be recovered by set- 
ting the parameter /3 in the preferential attachment function 
f{d) — d^ to specific values. 

(a) Constant case ^ = 0. This case is equivalent to a con- 
stant function f{d), and thus this graph growth model results 
in networks in which each edge is equally likely and inde- 
pendent from other edges. This is the Erdos-Renyi model of 
random graphs [20] . 

(b) Sublinear case < ^ < 1. In this case, the preferential 
attachment function is sublinear. This model gives rise to a 
stretched exponential degree distribution [15], whose exact 
expression is complex and given in [17, Eq. 94]. 

(c) Linear case /3 = 1. This is the scale-free network model 
of Barabasi and Albert [2], in which attachment is propor- 
tional to the degree. This gives a power law degree distribu- 
tion. 

(d) Superlinear case j3 > I. In this case, a single node 

will acquire 100% of all edges asymptotically [52]. Networks 
with this behavior will however display power law degree dis- 
tributions in the pre-asymptotic regime [32]. 



Curve Fitting 

We now describe our method for estimating the value of the 
parameter /3. Since the values of the degree d span sev- 
eral orders of magnitude, a simple least- squares curve fitting 
would give highly skewed results, as it would drastically over- 
weight high degrees. Therefore, we perform a least square fit- 
ting on the logarithmic degrees. The following minimization 
problem gives an estimate for the exponent /9. 

min y (a + I3\u[l + rfi(u)] - ln[A + d2{u)]f (2) 

The resulting value of fi is the estimated preferential attach- 
ment exponent. Note that due to the shift term of one and the 
regularization parameter A, our model can both accommodate 
nodes with degree zero at time t\, as well as nodes that do not 
receive any new Unk between ti and t2. 

To measure the error of the fit, we define the root-mean- 
square logarithmic error e in the following way: 



6 = exp I ^ 1^ E (« + /31n[l + - ln[A + d2{u)]f | 

This gives the average factor by which the actual new number 
of edges differs from the predicted value, computed logarith- 
mically. The value of e is larger or equal to one by construc- 
tion. 

EXPERIMENTS 

We compute an estimation of the preferential attachment ex- 
ponent j3 for forty-seven network datasets. All networks 
are taken from the Koblenz Network Collection (KONECT, 
konect.uni-koblenz.de). The full description of the networks 
can be read on the KONECT website' . The networks we use 
in our experiments fulfill the following criteria: 

• Creation times are known for all edges. 

• We exclude very incomplete datasets, in which the degree 
distributions are skewed by the samphng method used to 
generate the data. 

• We exclude networks that are too small, i.e., have less than 
10,000 edges. 

Table 1 shows the list of datasets used in our experiments. 
Methodology 

For each network, we split the set of edges into the set 
of old edges Ex and the set of new edges E \ Ei a& de- 
scribed in Equation (1). Then, we compute the old degree 
di(u) for all nodes u & V and the number of new edges 
d2{u) = d{u) — d\{u). We then solve the least-squares min- 
imization problem of Equation (2), giving an estimate of the 
preferential attachment exponent /3. 

The regularization parameter A is set to 0.1 in our experi- 
ments. 

'konect.uni-koblenz.de/networks 



Table 1. The list of network datasets used in tliis paper. Flags: U = 
Unipartite, D = Directed, B = Bipartite, M = Multiple edges. \V\ refers 
to the number of nodes in the network; \E\ refers to the number of edges 
in the network. In all networks, edges are annotated with edge creation 
times. 
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Figure 4. The preferential attachment exponent 13 and the root-mean- 
square logarithmic error e. Each two or three letter code represents one 
network dataset. The codes are given in Table 1. The color of the codes 
represent the network category. 



Experimental Results 

To illustrate the curve fitting procedure, we show the mean 
standard deviation of the logarithmic degree d2 = d{u) — 
di{u) as a function of di{u). Figure 3 shows this plot, along 
with the fitted curve, for the largest networks of six different 
network categories. 

The root-mean-square logarithmic error e is shown together 
with /? in Figure 4. 

The measured preferential attachment exponents /3 for all net- 
works are shown in Figure 5. The estimates for the power 
law exponent 7 are computed using the robust method given 
in [45, Eq. 5]. We note that the estimated degree distribution 
exponents lie in the approximate range [1,2.5], and are thus 
smaller than the usually cited range [2, 3] would suggest. 

In the case of superlinear attachment, the degree distribution 
is predicted to converge over time to a state in which a single 
node dominates all other nodes, i.e., in which a single node 
has 100% of all inlinks asymptotically. Let dmax be the de- 
gree of the node with most links in the networks. Then, to 
test whether such nodes are present in the studied networks. 
Figure 6 shows the ratio ln(o?max)/ ln(| V|) plotted against 
the preferential attachment f3. The plots exhibit a moderate 
agreement of the super- versus the sublinear cases for net- 
works in which 1.3 < /3 < 1.5 (such as RM, TH, Ls, Lb, 
PH, HA) and for 0.4 < /3 < 0.7 (like Cui, Bui, nen, ER, 
DG, AR, nfr). If (3 is close one (i.e., in the case of weak sub- 
linear or weak superlinear attachment), the agreement breaks 
down. 

DISCUSSION 

We have empirically investigated preferential attachment in 
forty-seven online networks and found that these networks 




(a) Flickr (b) Yahoo Songs (c) Enron 




(d) CiteULike user-publication (e) English Wikipedia (f) Last.fm song 



Figure 3. The mean number of new edges d2 (u) = d{u) — di (u) as a function of di (u) in the largest networks of six network categories, (a) Epinions 
trust (EP), a social network (b) Yahoo Songs (YS), a rating network (c) Enron (EN), a communication network (d) CiteULike user-pubUcation (Cui), 
a folksonomy, (e) English Wikipedia (en), a wiki edit network, (f) Last.fm song (Ls), a implicit interaction network. The bars indicate the logarithmic 
standard deviation measured over all nodes with the same value ofdi{u). The line represents the fitting curve /(d) = e"(l + di)*^ A. The standard 
deviation and mean on the plot is shown for illustration; it is not used in the fitting procedure. The actual curve fitting is performed by solving the 
optimization problem in Equation 2. For small values of di (m), the standard deviation is small due to the high number of nodes having low degree. For 
larger values of di (u), the standard deviation becomes higher due to the reduced number of nodes with high degree. For very large values of di {u), 
only one node has a given degree, and the standard deviation is undefined. 



follow a nonlinear preferential attachment model, i.e., their 
preferential attachment exponent is either larger than one (su- 
perlinear) or lower than one (sublinear). As such, we chal- 
lenge the often implicit assumption in Web Science that pref- 
erential attachment assumes a linear relationship (cf. [2]). 
Furthermore, we show that certain clearly distinct categories 
of online networks feature a superlinear preferential attach- 
ment exponent, whereas other categories feature a sublinear 
one. Our findings point out that previous studies of preferen- 
tial attachment in online networks might have oversimplified 
the underlying mechanisms by assuming linearity when, in 
fact, most online networks follow a nonlinear pattern. 

In particular, we find that the majority (70%) of the stud- 
ied online networks fall into the sublinear category, having 
f3 < 1. The networks in the sublinear category were pre- 
viously classified as rating, communication, folksonomy and 
social networks (see also Table 1). Also, a subset of the au- 
thorship networks falls into this category, specifically all wiki 
edit networks except those from Wikipedia (with the excep- 
tion of the French Wiktionary). The other 30% fall into the 
superlinear category where /3 > 1. These networks were clas- 
sified as explicit and implicit interaction networks. Also, the 



second subset of the authorship networks falls into this cate- 
gory, specifically all Wikipedias and the French Wiktionary. 

Our findings show that online networks do not follow a linear 
preferential attachment model. Actually, not one of the stud- 
ied networks featured linearity where (3 = 1 exactly. This 
is unexpected, as most literature implies such linearity. In 
addition, we show that online networks are also not consis- 
tent in their (non-)linearity: most networks follow a sublinear 
preferential attachment model, whereas others follow a su- 
perlinear model. However, we do find patterns that suggest 
an underlying internal consistency, because most of the pre- 
vious classifications fit in their entirety into one category (ex- 
cept the wiki edit networks). For example, all networks in the 
'rating networks' classification are part of the sublinear cate- 
gory, whereas all interaction networks fall into the superlinear 
category. 

Furthermore, we observe that similar to the distribution of 
the preferential attachment exponent, the power law exponent 
too is far from consistent as suggested by previous literature 
[1]. We find that while the values differ quite extremely, the 
distribution does not seem to follow a distinct pattern, nor 
is there a clear correlation with the preferential attachment 
exponent (see Figure 5). 
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Figure 5. The preferential attachment exponent /3 plotted against the 
power law exponent 7. Each two or three letter code represents one net- 
work dataset. The codes are given in Table 1. The color of the codes 
represent the network category. The estimates for the power law expo- 
nent 7 are computed using the robust method given in [45, Eq. 5]. 



An explanation of these findings might be that some net- 
works, in particular the ones falling into the superlinear cat- 
egory, follow other or at least additional governance mech- 
anisms than other networks [47]. For example, authors on 
Wikipedia (which is part of the superlinear category) might 
perceive the existence of a strong internal normative system 
that prescribes their online behavior [50]. Indeed, previous 
research has often observed the existence of social norms and 
normative systems in online networks (e.g., [16, 42, 59, 64]). 
This phenomenon is often explained by the absence of more 
formal and explicit governance mechanisms (e.g., [14, 26]) 
that are typically observed in other types of networks. 

CONCLUSION 

The findings presented in this paper show that interaction 
in online networks might be more complex than previously 
thought. In particular, we show that these networks follow a 
nonlinear preferential attachment model, contrary to what is 
suggested in the literature. Similarly, most of the networks 
that we studied have a power law exponent that is not even 
close to being consistently in the range 7 e [2,3] [1]. This 
leads to the conclusion that research into online networks 
might need to take into account other factors, and most impor- 
tantly employ different models that allow for the nonlinearity 
of the preferential attachment model. Also, the previous as- 
sumption of a more or less generalized range of 7 e [2,3] 
for the power law exponent seems to be challenged, as we 
observed variation in that value across the networks. 
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The codes are given in Table 1. The color of the codes represent the 
network category. 



Our work suggests a number of future research directions. 
First, as a direct consequence of our empirical findings, we 
suggest that future work should develop new models to al- 
low for nonlinearity of the preferential attachment exponent, 
as well as diversity of the power law exponent. Our findings 
undermine many of the previously developed models, as such 
we expect fruitful research in that direction. Second, and re- 
lated to our first point, future research should search for ex- 
planations for our findings. In the previous section, we tenta- 
tively highlight some possible explanations; however, empiri- 
cal studies need to establish their value. In particular, research 
should connect more mathematical approaches to study on- 
line networks (such as presented in this paper) and sociolog- 
ical attempts to explain the observed phenomena. For exam- 
ple, if the type of governance in a given network indeed in- 
fluences its preferential attachment and power law exponent, 
how exactly does that mechanism work? Related to this ques- 
tion it is important to investigate the emergence of networks, 
something that we did neglect in the current research. If we 
assume sociological mechanisms to play a role in the explana- 
tion of diversity and nonlinearity of the two exponents, then it 
follows that the antecedents of these mechanisms need to be 
investigated. For example, how does a certain type of gov- 
ernance in an online network come into being? What are 
the driving forces behind this emergence, and how can these 
mechanisms best be studied? We hope that our paper con- 
tributes to fuel research into that direction. 

Our study is subject to a number of limitations that present 
opportunities for future research. First, we do find that the 
preferential attachment exponent is nonlinear, similarly the 



power law exponent is distributed more diversely than ex- 
pected. However, we do not investigate the relationship be- 
tween these two observations, and suggest that future work 
further delves into this issue. Second, we investigated forty- 
seven datasets. Future research might broaden the scope of 
our study to include more and more diverse online networks. 
For example, in the current study we did not investigate net- 
works such as hyperUnk networks and affiliation networks. It 
should be fruitful to test the observations that we make in this 
paper on a larger scale, and as such generalize our findings to 
a broader level. 
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